# 128k Long Context
Devstral Small 2505 GGUF
Apache-2.0
An efficient language model specifically designed for software engineering projects, featuring a lightweight design and supporting a 128k large context window, suitable for complex coding tasks.
Large Language Model Supports Multiple Languages
D
Mungert
1,409
1
Devstral Small 2505 GGUF
Apache-2.0
Devstral is an intelligent LLM specifically designed for software engineering tasks, jointly developed by Mistral AI and All Hands AI. It excels in code exploration, multi-file editing, and driving software engineering agents.
Large Language Model Supports Multiple Languages
D
unsloth
72.26k
64
Devstral Small 2505 Unsloth Bnb 4bit
Apache-2.0
Devstral is a large language model for software engineering task agents, developed in collaboration between Mistral AI and All Hands AI. It excels at using tools to explore codebases, edit multiple files, and drive software engineering agents.
Large Language Model
Safetensors Supports Multiple Languages
D
unsloth
873
3
Devstral Small 2505
Apache-2.0
Devstral is an intelligent large language model developed by Mistral AI in collaboration with All Hands AI for software engineering tasks, excelling in codebase exploration, multi-file editing, and driving software engineering agents.
Large Language Model
Safetensors Supports Multiple Languages
D
mistralai
102.17k
601
Zero Mistral 24B
MIT
Zero-Mistral-24B is an improved text-only model based on Mistral-Small-3.1-24B-Instruct-2503, primarily adapted for Russian and English, with the original visual capabilities removed to focus on text generation tasks.
Large Language Model
Transformers Supports Multiple Languages

Z
ZeroAgency
41
2
Pixtral 12b GGUF
Apache-2.0
A multimodal large model launched by Mistral-Community, supporting image and text processing with 128k context length and variable image size handling capabilities.
Image-to-Text
P
lmstudio-community
611
1
Qwen2.5 The Wisemen QwQ Deep Tiny Sherlock 32B
Apache-2.0
Based on the QwQ-32B reasoning and thinking model, it incorporates features from multiple top-tier reasoning models, focusing on reducing 'overthinking' in prompts, suitable for creative use cases and in-depth reasoning.
Large Language Model
Transformers Other

Q
DavidAU
763
4
Llama3.1 MOE 4X8B Gated IQ Multi Tier COGITO Deep Reasoning 32B GGUF
Apache-2.0
A Mixture of Experts (MoE) model with adjustable reasoning capabilities, enhancing inference and text generation through collaboration of four 8B models
Large Language Model Supports Multiple Languages
L
DavidAU
829
2
Llama SEA LION V3.5 70B R
Llama-SEA-LION-v3.5-70B-R is a hybrid-function large language model optimized for Southeast Asian languages, supporting 13 languages with capabilities in complex reasoning and general text generation.
Large Language Model
Transformers Supports Multiple Languages

L
aisingapore
2,406
1
Llama SEA LION V3.5 8B R
Llama-SEA-LION-v3.5-8B-R is an 8B-parameter large language model optimized for Southeast Asian languages, supporting 13 languages with capabilities in complex reasoning and general text generation.
Large Language Model
Transformers Supports Multiple Languages

L
aisingapore
1,975
2
Cogito V1 Preview Qwen 32B Exl2 4.65bpw
Apache-2.0
Cogito v1 Preview is an instruction-tuned generative model based on Qwen2.5-32B, supporting over 30 languages with a context length of 128k, optimized for programming, STEM, instruction following, and general assistance.
Large Language Model
Transformers

C
async0x42
27
3
Xlam 2 3b Fc R
The xLAM-2 series are Large Action Models (LAMs) built with advanced data synthesis and training pipelines, specializing in multi-turn conversations and tool usage, demonstrating exceptional performance in function calling and agent tasks.
Large Language Model
Transformers English

X
Salesforce
353
5
Qwen2.5 MOE 2X1.5B DeepSeek Uncensored Censored 4B Gguf
Apache-2.0
This is a Qwen2.5 MOE (Mixture of Experts) model, composed of two Qwen 2.5 DeepSeek (censored/regular and uncensored) 1.5B models, forming a 4B model where the uncensored version of DeepSeek Qwen 2.5 1.5B dominates the model's behavior.
Large Language Model Supports Multiple Languages
Q
DavidAU
678
5
Llama 3.2 11b Vision R1 Distill
Llama 3.2-Vision is a multimodal large language model developed by Meta, supporting image and text inputs, optimized for visual recognition, image reasoning, and description tasks.
Image-to-Text
Transformers Supports Multiple Languages

L
bababababooey
29
1
Meta Llama 3.1 8B Instruct FP16
Llama 3.1 is a multilingual large language model collection developed by Meta, including 8B, 70B, and 405B parameter versions, supporting 8 languages, optimized for dialogue use cases.
Large Language Model
Safetensors Supports Multiple Languages
M
context-labs
565.13k
1
L3.2 Rogue Creative Instruct Uncensored 7B GGUF
Apache-2.0
A 7B-parameter uncensored creative writing model based on Llama 3.2 architecture, supporting 128k context length, optimized for novel writing, plot generation, and roleplaying
Large Language Model English
L
DavidAU
577
7
Llama 3.2 3B Instruct SpinQuant INT4 EO8
Llama 3.2 is a 1B and 3B parameter-scale multilingual pre-trained and instruction-tuned generative model from Meta, optimized for multilingual dialogue use cases and supporting 8 official languages.
Large Language Model
PyTorch Supports Multiple Languages
L
meta-llama
30.02k
35
Llama 3.2 3B Instruct AWQ
Llama 3.2 is a collection of multilingual large language models released by Meta, including pre-trained and instruction-tuned versions with 1B and 3B parameter scales, optimized for multilingual conversational use cases and supporting 8 official languages.
Large Language Model
Transformers Supports Multiple Languages

L
AMead10
4,500
2
Llama 3.2 1B Instruct
Llama 3.2 is a set of pre-trained and instruction-tuned generative models, including 1B and 3B scales, optimized for multilingual dialogue use cases, including agent retrieval and summarization tasks.
Large Language Model
Transformers Supports Multiple Languages

L
alpindale
31.82k
2
Vikhr Nemo 12B Instruct R 21 09 24
Apache-2.0
Vikhr-Nemo is a bilingual large language model optimized based on Mistral-Nemo-Instruct-2407, specifically designed for Russian and English, supporting various tasks such as logical reasoning, text summarization, and code generation.
Large Language Model
Transformers Supports Multiple Languages

V
Vikhrmodels
3,707
118
Llama 3.2 90B Vision Instruct
Llama 3.2-Vision is a multimodal large language model developed by Meta, supporting image and text input with text output, excelling in visual recognition, image reasoning, image captioning, and visual question answering tasks.
Image-to-Text
Transformers Supports Multiple Languages

L
meta-llama
15.44k
337
Llama 3.2 11B Vision
Llama 3.2-Vision is a series of multimodal large language models developed by Meta, available in 11B and 90B scales, supporting image + text input and text output, optimized for visual recognition, image reasoning, image captioning, and visual question answering tasks.
Image-to-Text
Transformers Supports Multiple Languages

L
meta-llama
31.12k
511
Llama 3.2 3B
Llama 3.2 is a multilingual large language model series developed by Meta, including 1B and 3B scale pre-trained and instruction-tuned generative models, optimized for multilingual dialogue scenarios, supporting text input/output.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
602.29k
555
Llama 3.2 1B Instruct
Llama 3.2 is a multilingual large language model series developed by Meta, including 1B and 3B scale pre-trained and instruction-tuned generative models, optimized for multilingual dialogue scenarios, supporting intelligent retrieval and summarization tasks.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
2.4M
901
Llama 3.2 1B
Llama 3.2 is a multilingual large language model series launched by Meta, including 1B and 3B parameter pre-trained and instruction-tuned generative models, optimized for multilingual dialogue scenarios, supporting agent retrieval and summarization tasks.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
2.1M
1,866
Llama 3.1 8B Instruct GGUF
Meta Llama 3.1 8B Instruct is a multilingual large language model optimized for multilingual dialogue use cases, excelling in common industry benchmarks.
Large Language Model English
L
modularai
9.7M
4
Meta Llama 3.1 8B Instruct GGUF
GGUF quantized version of Meta Llama 3.1 8B instruction-tuned model, suitable for multilingual dialogue scenarios
Large Language Model Supports Multiple Languages
M
MaziyarPanahi
499.87k
19
Llama 3.1 8B
Meta Llama 3.1 is a series of multilingual large language models, including 8B, 70B, and 405B pre-trained and instruction-tuned generative models, optimized for multilingual dialogue scenarios.
Large Language Model
Transformers Supports Multiple Languages

L
meta-llama
1.0M
1,583
Featured Recommended AI Models